Overview
Brought to you by YData
Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 577 |
| Missing cells (%) | 5.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 337.7 KiB |
| Average record size in memory | 345.8 B |
Variable types
| Numeric | 3 |
|---|---|
| Categorical | 7 |
Credit amount is highly overall correlated with Duration | High correlation |
Duration is highly overall correlated with Credit amount | High correlation |
Saving accounts has 183 (18.3%) missing values | Missing |
Checking account has 394 (39.4%) missing values | Missing |
Reproduction
| Analysis started | 2024-12-14 21:14:39.541782 |
|---|---|
| Analysis finished | 2024-12-14 21:14:42.977431 |
| Duration | 3.44 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
Age
Real number (ℝ)
| Distinct | 53 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.546 |
| Minimum | 19 |
|---|---|
| Maximum | 75 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 27 |
| median | 33 |
| Q3 | 42 |
| 95-th percentile | 60 |
| Maximum | 75 |
| Range | 56 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.375469 |
|---|---|
| Coefficient of variation (CV) | 0.32002106 |
| Kurtosis | 0.59577957 |
| Mean | 35.546 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 1.0207393 |
| Sum | 35546 |
| Variance | 129.40129 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 27 | 51 | 5.1% |
| 26 | 50 | 5.0% |
| 23 | 48 | 4.8% |
| 24 | 44 | 4.4% |
| 28 | 43 | 4.3% |
| 25 | 41 | 4.1% |
| 30 | 40 | 4.0% |
| 35 | 40 | 4.0% |
| 36 | 39 | 3.9% |
| 31 | 38 | 3.8% |
| Other values (43) | 566 |
| Value | Count | Frequency (%) |
| 19 | 2 | 0.2% |
| 20 | 14 | 1.4% |
| 21 | 14 | 1.4% |
| 22 | 27 | |
| 23 | 48 | |
| 24 | 44 | |
| 25 | 41 | |
| 26 | 50 | |
| 27 | 51 | |
| 28 | 43 |
| Value | Count | Frequency (%) |
| 75 | 2 | 0.2% |
| 74 | 4 | |
| 70 | 1 | 0.1% |
| 68 | 3 | 0.3% |
| 67 | 3 | 0.3% |
| 66 | 5 | |
| 65 | 5 | |
| 64 | 5 | |
| 63 | 8 | |
| 62 | 2 | 0.2% |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.62 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | male |
|---|---|
| 2nd row | female |
| 3rd row | male |
| 4th row | male |
| 5th row | male |
Common Values
| Value | Count | Frequency (%) |
| male | 690 | |
| female | 310 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 690 | |
| female | 310 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1310 | |
| m | 1000 | |
| a | 1000 | |
| l | 1000 | |
| f | 310 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4620 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1310 | |
| m | 1000 | |
| a | 1000 | |
| l | 1000 | |
| f | 310 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4620 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1310 | |
| m | 1000 | |
| a | 1000 | |
| l | 1000 | |
| f | 310 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4620 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1310 | |
| m | 1000 | |
| a | 1000 | |
| l | 1000 | |
| f | 310 | 6.7% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 630 | |
| 1 | 200 | 20.0% |
| 3 | 148 | 14.8% |
| 0 | 22 | 2.2% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 630 | |
| 1 | 200 | 20.0% |
| 3 | 148 | 14.8% |
| 0 | 22 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 630 | |
| 1 | 200 | 20.0% |
| 3 | 148 | 14.8% |
| 0 | 22 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 630 | |
| 1 | 200 | 20.0% |
| 3 | 148 | 14.8% |
| 0 | 22 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 630 | |
| 1 | 200 | 20.0% |
| 3 | 148 | 14.8% |
| 0 | 22 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 630 | |
| 1 | 200 | 20.0% |
| 3 | 148 | 14.8% |
| 0 | 22 | 2.2% |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.287 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | own |
|---|---|
| 2nd row | own |
| 3rd row | own |
| 4th row | free |
| 5th row | free |
Common Values
| Value | Count | Frequency (%) |
| own | 713 | |
| rent | 179 | 17.9% |
| free | 108 | 10.8% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| own | 713 | |
| rent | 179 | 17.9% |
| free | 108 | 10.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 892 | |
| o | 713 | |
| w | 713 | |
| e | 395 | |
| r | 287 | 8.7% |
| t | 179 | 5.4% |
| f | 108 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3287 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 892 | |
| o | 713 | |
| w | 713 | |
| e | 395 | |
| r | 287 | 8.7% |
| t | 179 | 5.4% |
| f | 108 | 3.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3287 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 892 | |
| o | 713 | |
| w | 713 | |
| e | 395 | |
| r | 287 | 8.7% |
| t | 179 | 5.4% |
| f | 108 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3287 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 892 | |
| o | 713 | |
| w | 713 | |
| e | 395 | |
| r | 287 | 8.7% |
| t | 179 | 5.4% |
| f | 108 | 3.3% |
Saving accounts
Categorical
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 183 |
| Missing (%) | 18.3% |
| Memory size | 54.4 KiB |
| little | |
|---|---|
| moderate | |
| quite rich | |
| rich | 48 |
Length
| Max length | 10 |
|---|---|
| Median length | 6 |
| Mean length | 6.4430845 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | little |
|---|---|
| 2nd row | little |
| 3rd row | little |
| 4th row | little |
| 5th row | quite rich |
Common Values
| Value | Count | Frequency (%) |
| little | 603 | |
| moderate | 103 | 10.3% |
| quite rich | 63 | 6.3% |
| rich | 48 | 4.8% |
| (Missing) | 183 | 18.3% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| little | 603 | |
| rich | 111 | 12.6% |
| moderate | 103 | 11.7% |
| quite | 63 | 7.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1372 | |
| l | 1206 | |
| e | 872 | |
| i | 777 | |
| r | 214 | 4.1% |
| h | 111 | 2.1% |
| c | 111 | 2.1% |
| m | 103 | 2.0% |
| o | 103 | 2.0% |
| d | 103 | 2.0% |
| Other values (4) | 292 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5201 | |
| Space Separator | 63 | 1.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1372 | |
| l | 1206 | |
| e | 872 | |
| i | 777 | |
| r | 214 | 4.1% |
| h | 111 | 2.1% |
| c | 111 | 2.1% |
| m | 103 | 2.0% |
| o | 103 | 2.0% |
| d | 103 | 2.0% |
| Other values (3) | 229 | 4.4% |
Space Separator
| Value | Count | Frequency (%) |
| 63 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5201 | |
| Common | 63 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1372 | |
| l | 1206 | |
| e | 872 | |
| i | 777 | |
| r | 214 | 4.1% |
| h | 111 | 2.1% |
| c | 111 | 2.1% |
| m | 103 | 2.0% |
| o | 103 | 2.0% |
| d | 103 | 2.0% |
| Other values (3) | 229 | 4.4% |
Common
| Value | Count | Frequency (%) |
| 63 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5264 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1372 | |
| l | 1206 | |
| e | 872 | |
| i | 777 | |
| r | 214 | 4.1% |
| h | 111 | 2.1% |
| c | 111 | 2.1% |
| m | 103 | 2.0% |
| o | 103 | 2.0% |
| d | 103 | 2.0% |
| Other values (4) | 292 | 5.5% |
Checking account
Categorical
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 394 |
| Missing (%) | 39.4% |
| Memory size | 54.6 KiB |
| little | |
|---|---|
| moderate | |
| rich |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.679868 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | little |
|---|---|
| 2nd row | moderate |
| 3rd row | little |
| 4th row | little |
| 5th row | moderate |
Common Values
| Value | Count | Frequency (%) |
| little | 274 | |
| moderate | 269 | |
| rich | 63 | 6.3% |
| (Missing) | 394 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| little | 274 | |
| moderate | 269 | |
| rich | 63 | 10.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 817 | |
| e | 812 | |
| l | 548 | |
| i | 337 | |
| r | 332 | |
| o | 269 | 6.6% |
| m | 269 | 6.6% |
| d | 269 | 6.6% |
| a | 269 | 6.6% |
| c | 63 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4048 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 817 | |
| e | 812 | |
| l | 548 | |
| i | 337 | |
| r | 332 | |
| o | 269 | 6.6% |
| m | 269 | 6.6% |
| d | 269 | 6.6% |
| a | 269 | 6.6% |
| c | 63 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4048 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 817 | |
| e | 812 | |
| l | 548 | |
| i | 337 | |
| r | 332 | |
| o | 269 | 6.6% |
| m | 269 | 6.6% |
| d | 269 | 6.6% |
| a | 269 | 6.6% |
| c | 63 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4048 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 817 | |
| e | 812 | |
| l | 548 | |
| i | 337 | |
| r | 332 | |
| o | 269 | 6.6% |
| m | 269 | 6.6% |
| d | 269 | 6.6% |
| a | 269 | 6.6% |
| c | 63 | 1.6% |
Credit amount
Real number (ℝ)
High correlation 
| Distinct | 921 |
|---|---|
| Distinct (%) | 92.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3271.258 |
| Minimum | 250 |
|---|---|
| Maximum | 18424 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 250 |
|---|---|
| 5-th percentile | 708.95 |
| Q1 | 1365.5 |
| median | 2319.5 |
| Q3 | 3972.25 |
| 95-th percentile | 9162.7 |
| Maximum | 18424 |
| Range | 18174 |
| Interquartile range (IQR) | 2606.75 |
Descriptive statistics
| Standard deviation | 2822.7369 |
|---|---|
| Coefficient of variation (CV) | 0.86289032 |
| Kurtosis | 4.2925903 |
| Mean | 3271.258 |
| Median Absolute Deviation (MAD) | 1097.5 |
| Skewness | 1.9496277 |
| Sum | 3271258 |
| Variance | 7967843.5 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1275 | 3 | 0.3% |
| 1262 | 3 | 0.3% |
| 1478 | 3 | 0.3% |
| 1393 | 3 | 0.3% |
| 1258 | 3 | 0.3% |
| 1199 | 2 | 0.2% |
| 2333 | 2 | 0.2% |
| 1295 | 2 | 0.2% |
| 1845 | 2 | 0.2% |
| 1474 | 2 | 0.2% |
| Other values (911) | 975 |
| Value | Count | Frequency (%) |
| 250 | 1 | |
| 276 | 1 | |
| 338 | 1 | |
| 339 | 1 | |
| 343 | 1 | |
| 362 | 1 | |
| 368 | 1 | |
| 385 | 1 | |
| 392 | 1 | |
| 409 | 1 |
| Value | Count | Frequency (%) |
| 18424 | 1 | |
| 15945 | 1 | |
| 15857 | 1 | |
| 15672 | 1 | |
| 15653 | 1 | |
| 14896 | 1 | |
| 14782 | 1 | |
| 14555 | 1 | |
| 14421 | 1 | |
| 14318 | 1 |
Duration
Real number (ℝ)
High correlation 
| Distinct | 33 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.903 |
| Minimum | 4 |
|---|---|
| Maximum | 72 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 12 |
| median | 18 |
| Q3 | 24 |
| 95-th percentile | 48 |
| Maximum | 72 |
| Range | 68 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 12.058814 |
|---|---|
| Coefficient of variation (CV) | 0.57689396 |
| Kurtosis | 0.91978136 |
| Mean | 20.903 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.0941842 |
| Sum | 20903 |
| Variance | 145.41501 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=33)
| Value | Count | Frequency (%) |
| 24 | 184 | |
| 12 | 179 | |
| 18 | 113 | |
| 36 | 83 | |
| 6 | 75 | |
| 15 | 64 | 6.4% |
| 9 | 49 | 4.9% |
| 48 | 48 | 4.8% |
| 30 | 40 | 4.0% |
| 21 | 30 | 3.0% |
| Other values (23) | 135 |
| Value | Count | Frequency (%) |
| 4 | 6 | 0.6% |
| 5 | 1 | 0.1% |
| 6 | 75 | |
| 7 | 5 | 0.5% |
| 8 | 7 | 0.7% |
| 9 | 49 | 4.9% |
| 10 | 28 | 2.8% |
| 11 | 9 | 0.9% |
| 12 | 179 | |
| 13 | 4 | 0.4% |
| Value | Count | Frequency (%) |
| 72 | 1 | 0.1% |
| 60 | 13 | 1.3% |
| 54 | 2 | 0.2% |
| 48 | 48 | |
| 47 | 1 | 0.1% |
| 45 | 5 | 0.5% |
| 42 | 11 | 1.1% |
| 40 | 1 | 0.1% |
| 39 | 5 | 0.5% |
| 36 | 83 |
Purpose
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 56.3 KiB |
| car | |
|---|---|
| radio/TV | |
| furniture/equipment | |
| business | |
| education | |
| Other values (3) |
Length
| Max length | 19 |
|---|---|
| Median length | 15 |
| Mean length | 8.559 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | radio/TV |
|---|---|
| 2nd row | radio/TV |
| 3rd row | education |
| 4th row | furniture/equipment |
| 5th row | car |
Common Values
| Value | Count | Frequency (%) |
| car | 337 | |
| radio/TV | 280 | |
| furniture/equipment | 181 | |
| business | 97 | 9.7% |
| education | 59 | 5.9% |
| repairs | 22 | 2.2% |
| domestic appliances | 12 | 1.2% |
| vacation/others | 12 | 1.2% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| car | 337 | |
| radio/tv | 280 | |
| furniture/equipment | 181 | |
| business | 97 | 9.6% |
| education | 59 | 5.8% |
| repairs | 22 | 2.2% |
| domestic | 12 | 1.2% |
| appliances | 12 | 1.2% |
| vacation/others | 12 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1035 | |
| i | 856 | 10.0% |
| e | 757 | 8.8% |
| a | 746 | 8.7% |
| u | 699 | 8.2% |
| n | 542 | 6.3% |
| / | 473 | 5.5% |
| t | 457 | 5.3% |
| c | 432 | 5.0% |
| o | 375 | 4.4% |
| Other values (13) | 2187 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7514 | |
| Uppercase Letter | 560 | 6.5% |
| Other Punctuation | 473 | 5.5% |
| Space Separator | 12 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 1035 | |
| i | 856 | |
| e | 757 | |
| a | 746 | |
| u | 699 | |
| n | 542 | |
| t | 457 | 6.1% |
| c | 432 | 5.7% |
| o | 375 | 5.0% |
| d | 351 | 4.7% |
| Other values (9) | 1264 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 280 | |
| V | 280 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 473 |
Space Separator
| Value | Count | Frequency (%) |
| 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8074 | |
| Common | 485 | 5.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 1035 | |
| i | 856 | |
| e | 757 | |
| a | 746 | |
| u | 699 | 8.7% |
| n | 542 | 6.7% |
| t | 457 | 5.7% |
| c | 432 | 5.4% |
| o | 375 | 4.6% |
| d | 351 | 4.3% |
| Other values (11) | 1824 |
Common
| Value | Count | Frequency (%) |
| / | 473 | |
| 12 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8559 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 1035 | |
| i | 856 | 10.0% |
| e | 757 | 8.8% |
| a | 746 | 8.7% |
| u | 699 | 8.2% |
| n | 542 | 6.3% |
| / | 473 | 5.5% |
| t | 457 | 5.3% |
| c | 432 | 5.0% |
| o | 375 | 4.4% |
| Other values (13) | 2187 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.7 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | good |
|---|---|
| 2nd row | bad |
| 3rd row | good |
| 4th row | good |
| 5th row | bad |
Common Values
| Value | Count | Frequency (%) |
| good | 700 | |
| bad | 300 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| good | 700 | |
| bad | 300 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1400 | |
| d | 1000 | |
| g | 700 | |
| b | 300 | 8.1% |
| a | 300 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3700 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1400 | |
| d | 1000 | |
| g | 700 | |
| b | 300 | 8.1% |
| a | 300 | 8.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3700 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1400 | |
| d | 1000 | |
| g | 700 | |
| b | 300 | 8.1% |
| a | 300 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3700 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1400 | |
| d | 1000 | |
| g | 700 | |
| b | 300 | 8.1% |
| a | 300 | 8.1% |
Interactions
Correlations
| Age | Checking account | Credit amount | Duration | Housing | Job | Purpose | Risk | Saving accounts | Sex | |
|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.000 | 0.026 | -0.036 | 0.271 | 0.155 | 0.064 | 0.113 | 0.059 | 0.278 |
| Checking account | 0.000 | 1.000 | 0.077 | 0.067 | 0.056 | 0.000 | 0.146 | 0.158 | 0.176 | 0.000 |
| Credit amount | 0.026 | 0.077 | 1.000 | 0.625 | 0.143 | 0.188 | 0.161 | 0.184 | 0.000 | 0.098 |
| Duration | -0.036 | 0.067 | 0.625 | 1.000 | 0.140 | 0.121 | 0.071 | 0.218 | 0.000 | 0.032 |
| Housing | 0.271 | 0.056 | 0.143 | 0.140 | 1.000 | 0.115 | 0.160 | 0.127 | 0.000 | 0.228 |
| Job | 0.155 | 0.000 | 0.188 | 0.121 | 0.115 | 1.000 | 0.143 | 0.000 | 0.028 | 0.073 |
| Purpose | 0.064 | 0.146 | 0.161 | 0.071 | 0.160 | 0.143 | 1.000 | 0.081 | 0.036 | 0.119 |
| Risk | 0.113 | 0.158 | 0.184 | 0.218 | 0.127 | 0.000 | 0.081 | 1.000 | 0.138 | 0.066 |
| Saving accounts | 0.059 | 0.176 | 0.000 | 0.000 | 0.000 | 0.028 | 0.036 | 0.138 | 1.000 | 0.000 |
| Sex | 0.278 | 0.000 | 0.098 | 0.032 | 0.228 | 0.073 | 0.119 | 0.066 | 0.000 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Sample
| Age | Sex | Job | Housing | Saving accounts | Checking account | Credit amount | Duration | Purpose | Risk | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 67 | male | 2 | own | NaN | little | 1169 | 6 | radio/TV | good |
| 1 | 22 | female | 2 | own | little | moderate | 5951 | 48 | radio/TV | bad |
| 2 | 49 | male | 1 | own | little | NaN | 2096 | 12 | education | good |
| 3 | 45 | male | 2 | free | little | little | 7882 | 42 | furniture/equipment | good |
| 4 | 53 | male | 2 | free | little | little | 4870 | 24 | car | bad |
| 5 | 35 | male | 1 | free | NaN | NaN | 9055 | 36 | education | good |
| 6 | 53 | male | 2 | own | quite rich | NaN | 2835 | 24 | furniture/equipment | good |
| 7 | 35 | male | 3 | rent | little | moderate | 6948 | 36 | car | good |
| 8 | 61 | male | 1 | own | rich | NaN | 3059 | 12 | radio/TV | good |
| 9 | 28 | male | 3 | own | little | moderate | 5234 | 30 | car | bad |
| Age | Sex | Job | Housing | Saving accounts | Checking account | Credit amount | Duration | Purpose | Risk | |
|---|---|---|---|---|---|---|---|---|---|---|
| 990 | 37 | male | 1 | own | NaN | NaN | 3565 | 12 | education | good |
| 991 | 34 | male | 1 | own | moderate | NaN | 1569 | 15 | radio/TV | good |
| 992 | 23 | male | 1 | rent | NaN | little | 1936 | 18 | radio/TV | good |
| 993 | 30 | male | 3 | own | little | little | 3959 | 36 | furniture/equipment | good |
| 994 | 50 | male | 2 | own | NaN | NaN | 2390 | 12 | car | good |
| 995 | 31 | female | 1 | own | little | NaN | 1736 | 12 | furniture/equipment | good |
| 996 | 40 | male | 3 | own | little | little | 3857 | 30 | car | good |
| 997 | 38 | male | 2 | own | little | NaN | 804 | 12 | radio/TV | good |
| 998 | 23 | male | 2 | free | little | little | 1845 | 45 | radio/TV | bad |
| 999 | 27 | male | 2 | own | moderate | moderate | 4576 | 45 | car | good |